Model Selection

Lightweight Pretraining

# Lightweight Pretraining

Arsh LLM is an open-source large language model designed for research, pretrained on the olmo mixed dataset using a T4 GPU, with a total training time of approximately 4-5 days.

Large Language Model

Tinymistral 248M

A language model scaled down from Mistral 7B to 248 million parameters, designed for text generation tasks and suitable for downstream task fine-tuning.

Large Language Model

Transformers English

SEW-tiny is a compressed and efficient speech pretraining model developed by ASAPP Research, pretrained on 16kHz sampled speech audio, suitable for various downstream speech tasks.

Speech Recognition

Transformers Supports Multiple Languages

Bert L12 H256 A4

A lightweight BERT model pretrained using knowledge distillation techniques, with a hidden layer dimension of 256 and 4 attention heads, suitable for masked language modeling tasks.

Large Language Model

Mengzi Oscar Base Caption

A Chinese multimodal image captioning model fine-tuned on the AIC-ICC Chinese image caption dataset, based on the Mengzi-Oscar pretrained model

Transformers Chinese

Bert Base Arabic Camelbert Msa Sixteenth

Pretrained model for Arabic NLP tasks, trained on a reduced-scale (1/16) Modern Standard Arabic (MSA) dataset

Large Language Model Arabic

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase